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SUMMARY 


The  principal  part  of  this  paper  is  devoted  to  the  study 
of  the  distribution  aad  density  functions  of  the  ratio  of  two 
normal  random  variables.  It  gives  several  representations  of 
the  distribution  fiction  in  terms  of  the  bivariate  normal 
distribution  and  Nicholson’s  V  function,  both  of  which  have 
been  extensively  studied,  and  for  which  tables  and  computational 
procedures  are  readily  available.  One  of  these  representations 
leads  to  an  easy  derivation  of  the  density  function  in  terms 
of  the  Cauchy  density  and  the  normal  density  and  integi*al.  A 
number  of  graphs  of  the  possible  shapes  of  the  density  are 
given,  together  with  an  indication  of  when  the  density  is 
unimodal  or  bimodal. 

The  last  part  of  the  paper  discusses  the  distribution  of 
the  ratio  (u^  +•••+  u^/Cv^  +•••+  v^)  where  the  u’s  and 
v’s  are  independent,  uniform  variables.  The  exact  distribu¬ 
tion  for  all  n  and  m  is  given >  and  some  approximations 


discussed 


1.  Introduction.  The  first  part  of  this  paper  will  discuss  the 
distribution  of  the  ratio  of  normal  random  variables;  the  second  part, 
the  distribution  of  the  ratio  of  sums  of  uniform  random  variables. 

There  does  not  seem  to  be  much  in  the  literature  concerning  the  ratio 
of  normal  variables  -  there  are  some  comments  by  Curtiss  in  his  paper, 
[2] ,  on  the  ratios  of  arbitrary  variates ,  and  papers  by  Fieller  [4] » 
and  Geary  [5],  all  of  which  are  quite  old.  It  might  be  thought  that 
the  subject  is  so  simple  that  it  was  considered  long  ago,  then  dropped, 
but  this  is  not  quite  the  case.  Unless  the  means  are  zero,  where  one 
easily  gets  the  Cauchy  distribution,  the  distribution  of  the  ratio  of 
normal  variables  does  not  respond  readily  to  the  devices  that  work  so 
well  for  other  important  quotients  in  statistics,  e.g.,  those  oi  t,  z, 
or  F.  Curtiss  remarks  that  it  is  apparently  impossible  to  evaluate 
the  density  in  closed  form,  a  rather  vague  statement.  We  will  derive 
the  exact  density  of  the  ratio  of  two  arbitrary  normal  variates  by 
what  might  be  called  modern  methods  -  not  in  the  sense  of  using  power¬ 
ful  new  techniques,  but  merely  by  using  properties  of  distributions 
that  have  been  extensively  studied  in  the  intervening  years.  The 
density  may  be  expressed  as  the  product  of  a  Cauchy  density  and  a 
factor  involving  the  normal  density  and  integral,  which  might  be  con¬ 
sidered  a  closed  expression  (equation  (5)  of  Section  2).  At  any  rate, 
there  are  now  available  a  number  of  methods  for  handling  the  functions 
associated  with  the  distribution  and  density  of  the  ratio,  and  with  the 
aid  of  a  computer,  we  may  study  them  in  detail. 

Aside  from  its  frequent  occurrence  in  problems  involving  the  ratio 
of  measured  quantities  with  a  random,  presumably  normal,  error,  the 
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problem  of  the  ratio  of  normal  variates  is  of  importance  in  regression 
theory.  In  fitting  a  l?ne  to  points  (x^y^) ,...,(xn»yn) ,  the  x’s 
assumed  constant  and  the  y's  independent  normal  with  E(y^)  =  a  +  px^, 
one  gets  a  and  p  as  estimates  of  a  and  p  by  least  squares.  It 
is  natural  to  estimate  the  x-intercept  of  the  regression  line  in  the  form 
-  a/p,  and  thus  the  problem  of  the  ratio  of  normal  variates  arises. 

The  following  example  of  this  problem  occurs  in  medicine:  in  order 
to  estimate  the  life  span  of  the  circulating  red  blood  cells  of  a  subject, 
a  number  of  his  red  cells  are  labelled  and  then,  by  some  means  or  other, 
the  number  of  labelled  cells  still  in  the  circulation  is  sampled,  say, 
every  5  days  for  50  days.  This  gives  a  sequence  of  points  which  are 
plotted  and  fitted  with  a  straight  line;  the  point  -./here  the  line  inter¬ 
cepts  the  time  axis  is  used  as  the  estimate  of  the  red  cell  life  span. 

It  is  important  to  know  the  distribution  of  this  estimate  about  its 
true  value  -  the  normal  red  cell  life  span  is  about  120  days  and 
shortened  life  spans  are  associated  with  various  hematological  disordeis, 
most  of  them  severe. 

We  will  discuss  the  distribution  and  density  of  the  ratio  of  two 

normal  random  variables  in  Section  2.  In  Section  4  we  will  discuss  the 

distribution  of  ratios  of  the  form  (u,  +•••+  u  )/(v,  +•••+  v  )  where 

the  u’s  and  v’s  are  independent  uniform  variables;  a  recent  paper 

[8] ,  on  this  distribution  for  n  =  m  =  3  led  to  its  being  considered 

here.  We  will  find  the  exact  distribution  for  all  n  and  m,  and 

examine  the  closeness  of  the  normal  approximation.  On  the  way  to  finding 

the  distribution  of  (u,  +•••+  u  )/(vn  +•••+  v  )  we  will  need  the  distri- 

i.  n  jl  m 

bution  of  a  linear  combination  of  uniform  variates;  some  comments  on  this 
distribution  and  its  history  are  in  Section  3 • 
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2.  Ratios  of  normal  variables  We  are  concerned  with  the  distribution 
of  the  ratio  of  two  normal  random  variables.  The  problem  has  been  dis¬ 
cussed  in  the  past,  [2,4,5]  •  We  will  bring  the  problem  up  to  date  in 
this  Section  -  give  an  explicit  representation  of  the  distribution  in 
terms  of  what  are  now  familiar  functions,  and  discuss  in  more  detail 
some  of  the  properties  of  the  distribuxion. 


Let 


(1) 


w 


=  aJLz. 

b  +  y 


where  a,b  are  non-negative  constants  and  x,y  are  independent  standard 
normal  random  variables.  It  is  easy  to  see  that  if  wf=  x^/y^  is  the 
ratio  of  two  arbitrary  normal  random  variables,  correlated  or  not,  then 
there  are  constants  c^  and  such  that  c^  +  c^w1  has  the  same 
distribution  as  v.  It  thus  suffices  to  study  the  distribution  of  (l)$ 
translations  and  changes  of  scale  will  provide  the  distributions  of 
the  general  ratio  x^/y^. 


The  set  of  points  (x,y)  for  which  <  t  is  a  region  bounded 

by  straight  lines,  and  the  normal  probability  measures  of  such  regions 
have  been  extensively  studied  in  the  past  few  years.  Ve  should  thus 
be  able  to  express  the  distribution  of  w  in  terms  of  functions  asso¬ 
ciated  with  those  measures j  particularly  the  bivariate  normal  distribu¬ 
tion  function 


L(h,k,p)  =  ?{%  >  h,  e  >  k] 


where  K  and  t>  are  standard  normal  with  covariance  p,  and  the  V 
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function  of  Nicholson  (ll] : 

h  q,x/h 

V(h,q)  =  55  <p(x)<p(y)dydx, 
0  0 


where  <p  is  the  standard  normal  density.  We  have 

P[v  <  t]  =  P[a  +  x  <  t(b  +  y)',b  +  y  >  0]  +  P[a  +  x  >  t(b  +  y)  ,b  +  y  <  0] 
=  P[-x  +  ty  >  a  -  bt,y  >  -b]  +  P[x  -  ty  >  -a  +  bt,y  >  b] 


=  U—Ezzb  -t, 

Al  + 12 


1  )  +  L( b,  - $ - ). 


ATt2 


Jl  +  t2  AT  t2 


Then  using  the  elementaiy  properties  of  the  L  and  V  functions  (see, 
for  example,  the  NBS  table  [10] ,p.  vii) , 

h  k 

L(-h,-k,p)  =  L(h,k,p)  +  5  ?U)dx  +  5  ?(x)dx, 

0  0 

L(-fc,-lc,p)  +  L(h,k,p)  =  2V(a,-k-"-c-h)  +  2V(k,— --&■)  +  |  +  3ln~lf, 

JlTf  JT^7 


we  have  several  representations  of  F(t)  =  P[~-~  <  t]: 


(2)  F(t)  =  L C4==>  )  +  U~==>  b> 


A  +  t5 


AT  r 


Ji  + 12  AT? 


(bt-a)//l+t2  b 

(3)  F(t)  =  5  <p(x)dx  +  5  tp(x)dx  +  2L( - ,  b, 


AT  t2 


AT  t2 


(4)  F(t)  =  T  r  tan‘lt  +  2 V( ------ >  T  a-)  -  2Y(b,a) . 

2  AT?  AT^ 


Representation  (4)  appears  best  for  numerical  purposes,  unless  b 
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Figure  1.  Graphs  of  the  density  of  (a  +  x)/(b  +  y) ,  where  a  >  0, 
b  >  0  and  x,y  are  independent,  standard  normal  random  variables. 
The  formula  for  the  density  is  in  equation  (5).  Values 
a  =  0/3,l/3,...,6/3  and  b  =  o/8,l/8,.. .,8/8  were  chosen  so  as 
to  represent  the  possible  shapes  of  the  density  function 


is  large,  say  b  >  3,  since  we  have  good  methods  for  providing  values 
of  V  and  —  tan  ^t,  [9],[10],  and  [l3]«  However,  when  b  is  large, 
the  second  and  third  terms  of  (3)  may  be  replaced  by  .5  and  0,  so 
that 


P[ 


a  +  x 
b  +  y 


<  t] 


.  (bt-*)//m^  (bt-a></l+? 

r  +  5  ®(x)dx  =  $  <p(x)dx 

*  Q  -oo 


provides  very  good  numerical  approximations  to  F(t) ,  plus  the  additional 
information  that  (bw  -  a)//l+v2  is  approximately  normally  distributed. 


Now  we  turn  to  the  density  of 
,  _  bt  -  a  _  b  +  at 

fl - ■ — ->  q  - 

+  t2  Jl  +  t2 


(a  +  :c)/(b  +  y) . 

.  2.  d  +  at 

h  b  -  at  * 


Let 


Using  primes  to  indicate  differentiation  with  respect  to  t,  so  that 
h’  =  q/(l  +  t2) ,  X*  =  -(a2  +  b2)/(bt  -  a)2,  we  differentiate  (4)  +o  get 


f(t)  =  F1 


1  q  b 

- — =-  +  2h«<p(h)  $  <p(y)dy  ->•  2X»  J 

tr(  1  +  t*)  0  0 


x<p(x)<p(Xx)dx. 


Integrating  the  last  term  and  simplifying,  we  get  this  form  for  f(t), 
the  density  function  of  the  ratio  ~ ~ 


(5)  f(t) 


-.5(a2+b2)  q 

+ ^  o  ,Cy)dy1, 


b  +  at 

9  '  T—  • 

Jl  +  t2 


Figure  1  shows  f(t) ,  the  density  of  (a  +  x)/(b  +  y) ,  for  various 

values  of  a  and  b.  The  curves  in  Figure  1  were  drawn  by  a  computer;  it 

fit  X 

also  drew  the  identification  for  each  density  in  the  form  £■ where 
a  is  a  multiple  of  j  and  b  a  multiple  of  g.  The  values  of  a  and 


Figure  2. 


The  denrMy  of  (a  +  x)/(b  +  y) 
according  to  the  region  of  the 
the  point  (a,b)  falls. 


is  unimodal  or  bin 
positive  quadrant  in 
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b  were  chosen  so  as  to  give  a  rough  indication  of  the  possible  shapes 
of  the  densities  given  by  fonnula  (5)«  As  you  can  see,  some  unusual 
shapes  are  encountered.  The  positive  a,b  quadrant  may  be  divided  into 
two  regions  according  to  whether  the  density  of  (a  +  x)/(b  +  y)  is 
unimodal  or  bimodal,  r.s  in  Figure  2.  The  curve  that  determines  the  two 
regions  is  asymptotic  to  a  =  2.257.  Thus  when  a  >  2.257,  the  density 
of  (a  +  x)/(b  +  y)  is  bimodal,  even  though  it  may  not  appear  so.  For 
example,  the  density  of  (10^  +  x)/(10^  +  y) ,  x  and  y  independent 
standard  normal,  would  appear  to  be  a  single  spike  at  t  =  1,  but  in 
fact  it  has  another  mode  somewhere  in  the  vicinity  of  t  =  -1012. 


We  conclude  this  Section  with  a  summary. 


a  +  x 


Summary  of  the  properties  of  the  ratio  w  =  ^  where  a  >  0,  b  >  0, 

x  and  y  independent  standard  normal.  ^ 

1.  If  w*  =  x-^/y^  ^he  r81^0  oi1  any  two  jointly  normal  variables, 
then  there  are  constants  c^  and  c?  so  that  c^  +  c^w  has  the  same 


distribution  as  w. 


2.  The  distribution  of  w,  say  F(t)  =  P[*-|  ■»  <  t] ,  may  be  expressed 
in  terms  of  the  bivariate  normal  distribution,  or  Nicholson’s  V  function 
in  several  ways  -  formulas  (2),  (3),  and  (4)  above. 

j  2 

3.  When  b  is  large,  say  b  >  3,  then  (ow  -  a)//l  v  is  approxi¬ 


mately  normally  distributed,  and  ^ 

(bt-a)//l+t2 

P[v  £  t]  =  <  t]  =  S  tp(u)du. 


4.  The  density  of  is  given  by  formula  (5).  This  density  is 

plotted  foT'  various  a  and  b  in  Figure  1. 
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^  41  y 

5.  The  density  of  —  la  unimodal  or  bimodal  according  to  the  region 

o  +  y 

of  Figure  2  in  which  (a,b)  lies.  When  a  >  2.257,  the  density  is  bi¬ 
modal,  although  one  of  the  inodes  may  be  insignificant. 


3.  The  distribution  of  c,u,  +•••+  c  u  . 

. . . . . - - - 11  n  n 


Let  u^,...,un  be  independent  random  variables,  each  uniformly 
distributed  over  the  interval  (0,1).  In  the  next  Section  ve  will 
need  the  distribution  of  a  linear  contribution  of  the  u’s. 


(6) 


c,u  +  c  u  +••*+  c  u 
11  2  2  n  n 


with  the  c’s  positive.  The  general  linear  form  in  the  u's  can 
readily  be  reduced  to  (6) ,  for  example 
3^  -  2u2  +  5u^ 

has  the  same  distribution  as 

3u^  -  2(1  -  u2)  t  5u^  =  3un  +  2u2  -*■  5u^  -  2, 

since  1  -  u2  has  the  same  distribution  as  u2* 


There  have  been  a  number  of  discussions  of  the  distribution  of  (6) 
in  the  literature  -  the  problem  (for  equal  c’s)  dates  back  to  Laplace 
[7],  who  solved  it  as  a  limiting  form  of  the  discrete  case*, and,  again 
with  equal  c’s,  the  result  is  in  standard  textbooks,  e.g.,  Uspensky  [17], 
who  inverted  the  characteristic  function,  and  Cramer  [l] ,  proof  by  suc¬ 
cessive  convolution.  For  unequal  c’s  the  result  was  given  by  Olds  [12], 
and  the  distribution  appeared  as  a  problem  on  volumes,  [3],  with  subse¬ 
quent  remarks  on  xts  proof  -  particularly  a  development  of  Schoenberg  [15] , 
using  recursive  relations  for  spline  curves. 

*The  discrete  case  of  the  problem,  which  may  be  viewed  as  the  problem  of 
finding  the  sum  on  n  "dice”,  each  one  having  a  certain  number  of  faces, 
has  an  even  more  curious  histoiy.  In  1710  Hontmart  solved  the  problem  for 
equal  dice,  as  did  DeMoivre  in  1711,  Simpson  in  1740,  LaGrange  around  1770,- 
and  Laplace  in  1774.  Montmart  attempted,  but  did  not  solve,  the  problem 
of  unequal  dice.  See  Todhunter’s  History  [16] , Articles  148,149,364,888,915,987. 
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More  recently,  Roach  [14],  offered  a  geometric  argument. 

Thus  the  problem  is  now  well  known,  and  it  is  not  particularly 
difficult,  although  rotational  difficulties,  plus  the  fact  that  the 
problem  may  be  viewed  as  one  of  probability,  geometry,  or  spline 
functions,  have  led  to  a  variety  of  proofs. 

Roughly,  the  distribution  of  c^u^  +#**+  cnun  be  described 

as  follows:  Let  S  be  the  set  of  all  2n  numbers  which  can  be  formed 
as  a  sum  of  different  c’s: 

S  =  {0,c1,...,cn,c1  +  c2,...,Cl  +•••+  cj. 


Then 


P[c^u^  +•  •■ 


+  c  u  <  a] 
n  n 


rile,  ♦  4  »c  n  > 
12  n  seS,s<a 


Z  +  (a  -  s)n. 


the  +  or  -  being  according  to  whether  there  are  an  even  or  odd 
number  of  c's  used  to  form  s .  For  example, 

P(2ux  +  3u2  +  8u3  <  7]  =  jr^y[73  -  (7  -  2)3  -  (7  -  3)3  +  (7  -  5)3] 

and 

(7)  P[2ux  +  3u2  +  8u3  <  12]  =  123  -  (12  -  2)3  -  (12  -  3)3  -  (12  -  8)3 

+  (12  -  5) 3  +  (12  -  10)3  +  (12  -  ll)3]. 

Note  also  that  the  distribution  of  2u^  +  3u2  +  8u^  is  symmetric 
(any  linear  combination  of  independent  symmetric  random  variables  is 
symmetric),  and  that,  rather  than  compute  expression  (7),  one  might 


consider 


P[2ux  +  3u2  *  8u3  <  12]  =  P[2(l  -  ux)  +  3(1  -  ^2)  +  8(1  -  u3)  <  12] 

=  P[2Uj  +  ^2  +  ^3  >  1^  =  1  “  3.(437  • 

We  nay  formally  describe  the  distribution  of  c^u^  +••♦+  cnun  as 
follows : 

Theorem  1.  Let  tti»'U2,U3,‘  **  ,Un  independent  random  variables ,  each  uni¬ 
formly  distributed  over  the  Interval  (0,1),  and  let  qi*°2' ' * ' yCn  be 
positive  constants.  Let 

Fn(a)  =  Prob[c1u1  +♦•  •+  <  a] 

and  let 


gn(x)  = 


nic-,c_.  ..c 


if  x  <  0, 


if  0  <  x. 


12  n 
Then,  for  0  <  a  <  +•♦•+  cn, 

rn(a)  =  gn(a)-  2  g^a-^)  *  ‘  +  ‘" 

The  theorem  may  be  easily  proved  by  induction,  using  the  elementary 
results : 


Wa)  =  J 

n+1  n+1  0 


n+1 


F  (a  -  :c)dx 
n 


'n+1 


—  J  gn(b  -  *)d*  =  gn+1»>)  -  gn+1(b  -  <W  • 


'n+1  0 


n+1 


and 


10 


When  the  c's  are  all  equal  to  1,  the  result  takes  the  following 

form: 


P[u..  +•••+ 


u 


n 


<  a]  =  i[an  -  (£)(a  -  l)n  +  (£)(a  -  2)n  •••], 


where  the  terms  are  taken  as  long  as  a, a  ~  l,a  -  2,***,  are  positive. 
More  formally,  fox-  0  <  a  <  n,  and  with  the  greatest  integer  notation, 

n  M 

P[u  +  •••-*■  ufl  <  aj  =  ^-  2  (-  l) 1(a  -  i)n. 

i=0 

xi  +. . .+  u 

4.  The  distribution  of  - - - - — — . 

-  V;L  +•••+  vm 

Let  un ,u  , . . . ,u  ,v, , . . . ,v  be  independent  random  variables ,  each 
12  n  l  m 

uniform  over  (0,1).  We  want  the  distribution  of 

u..  +•  •  *+  u 
_1 _ n 

(8)  v  +. . .+  v  * 

1  m 

The  distribution  of  (8)  is  of  interest  in  studying  round-off  error 
propagation  in  numerical  analysis,  see  [6],[l8].  The  particular  case 
m  =  n  =  2  was  worked  out  in  detail  in  [g].  We  will  find  the  distribu¬ 
tion  of  (8)  for  all  n  and  m,  by  applying  the  results  of  the  previous 
Section,  and  will,  in  addition,  discuss  approximations  to  the  distribution. 


Since  1  -  v.  is  distribi:ted  as  v. ,  we  have 

l  i 


u,  +•  •  •+  u  u..  +•  •  *+  u 

P[^-fTTTnf  <  *3  “  pt(l  -V.)  ♦'...+  (1  -VI  <  *1 
1  m  -l  m 


=  Pfu,  +•••+  u  +  av,  av,,  +•••+  av  <  mal 
1  n  1  2  m 


and  hence  a  direct  application  of  Theorem  1  gives,  (after  a  little  thought 
about  hew  the  terms  combine) : 


u,  +•  •  •+  u 

P[— -7 - : — ~  <  a]  = 

1  m 


n  [ma]  [(ma-i)/a]  ...  . 

- i - -V  2  (-  l)liJ(") (”)[(»,  -  j)a  -  i]n+I". 

(n  +  m)!a‘"  i=0  j=0  J 
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For  example, 


(0}  [(o)  (4-5)-L^-(^)(3.6)12+(|)  (2.7)-L'd-(|)  (l.8)J";:+(^)  (  •9)-l^] 
/7\r/5wo  „  \  12  4\12,/5w-.  _\l<!,/5\/  o\  12-i  j 


12  ,5, 


,12w5. 


Y(')[(J)(3o)i‘Mj)(2.6)1^(^)(l.7)x<i+(p(.8)^3 

PfU^^J<-^--^^(lH(50)(2.5)12-(^)(1.6)12+(5)(.7)12] 


-(^)  [(^)U-5)12-(|)(  .6)12] 


The  variate  (u^  +•••  +  un)/(v1  'f“*+  vm^  is  approximately  a  ratio 
of  independent  normal  variables,  and  the  discussion  of  Section  2 
should  apply.  We  may  derive  a  good  normal  approximation  directly, 
however,  writing 
u,  +• • •+  u 

Pf - . - . — H  <  a]  =  p[u_  +•••+  u  +  avn  av  <  ma] . 

Lvn  +*••+  v  x  J  '-1  n  1  m  J 


Since  the  sum  on  the  rignt  is  approximately  normal  with  mean  .5  in  +  ma] 
2  \ 

and  variance  (am  +  n)/l2,  we  have 


V1  +  +  Tffi  /  2  . 

•>/  a  m  -r  n 


Figure  3  gives  some  indication  of  the  merits  of  this  approximation. 
In  case  it  is  necessary  to  get  the  tail  of  the  distribution  with 
great  precision,  it  is  not  too  difficult  to  calculate  the  exact  pro¬ 
babilities:  for  0  <  a  <  jjj-, 

ui+***+u  n  n+nJ  m  n x n+m  t  fms ^n+m 

- t - <  aj  =  "T~TTlm  -  )  (m  -  1}  +  IJlm  -2) 

Lv,  +•••+  v  (n+m);  1  2 


and  for  b  >  n. 


>  »i  =  T^rV^rt^™  -  (?(» -  »nT“  -  (?(» -  *>■ 

1  m 


,  n+m  ,  / n\ 
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